NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Efficient Evaluation Algorithms for Sound Event Detection

Lostanlen, Vincent; McFee, Brian (September 2023, Proceedings of the 8th Detection and Classification of Acoustic Scenes and Events 2023 Workshop (DCASE2023))

Full Text Available
Self-Calibrating Acoustic Sensor Networks with Per-Channel Energy Normalization

Lostanlen, Vincent (January 2021, Euronoise)

Full Text Available
Chirping up the Right Tree: Incorporating Biological Taxonomies into Deep Bioacoustic Classifiers

https://doi.org/10.1109/ICASSP40776.2020.9052908

Cramer, Jason; Lostanlen, Vincent; Farnsworth, Andrew; Salamon, Justin; Bello, Juan Pablo (May 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP))
null (Ed.)
Class imbalance in the training data hinders the generalization ability of machine listening systems. In the context of bioacoustics, this issue may be circumvented by aggregating species labels into super-groups of higher taxonomic rank: genus, family, order, and so forth. However, different applications of machine listening to wildlife monitoring may require different levels of granularity. This paper introduces TaxoNet, a deep neural network for structured classification of signals from living organisms. TaxoNet is trained as a multitask and multilabel model, following a new architectural principle in end-to-end learning named "hierarchical composition": shallow layers extract a shared representation to predict a root taxon, while deeper layers specialize recursively to lower-rank taxa. In this way, TaxoNet is capable of handling taxonomic uncertainty, out-of-vocabulary labels, and open-set deployment settings. An experimental benchmark on two new bioacoustic datasets (ANAFCC and BirdVox-14SD) leads to state-of-the-art results in bird species classification. Furthermore, on a task of coarse-grained classification, TaxoNet also outperforms a flat single-task model trained on aggregate labels.
more » « less
Full Text Available
Learning the Helix Topology of Musical Pitch

https://doi.org/10.1109/ICASSP40776.2020.9053644

Lostanlen, Vincent; Sridhar, Sripathi; McFee, Brian; Farnsworth, Andrew; Bello, Juan Pablo (May 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP))
null (Ed.)
To explain the consonance of octaves, music psychologists represent pitch as a helix where azimuth and axial coordinate correspond to pitch class and pitch height respectively. This article addresses the problem of discovering this helical structure from unlabeled audio data. We measure Pearson correlations in the constant-Q transform (CQT) domain to build a K-nearest neighbor graph between frequency subbands. Then, we run the Isomap manifold learning algorithm to represent this graph in a three-dimensional space in which straight lines approximate graph geodesics. Experiments on isolated musical notes demonstrate that the resulting manifold resembles a helix which makes a full turn at every octave. A circular shape is also found in English speech, but not in urban noise. We discuss the impact of various design choices on the visualization: instrumentarium, loudness mapping function, and number of neighbors K.
more » « less
Full Text Available
One or Two Frequencies? The Scattering Transform Answers

https://doi.org/10.23919/Eusipco47968.2020.9287216

Lostanlen, Vincent; Cohen-Hadria, Alice; Pablo Bello, Juan (January 2020, 28th European Signal Processing Conference (EUSIPCO))

Full Text Available
Fourier at the heart of computer music: From harmonic sounds to texture

https://doi.org/10.1016/j.crhy.2019.07.005

Lostanlen, Vincent; Andén, Joakim; Lagrange, Mathieu (September 2019, Comptes Rendus Physique)

Full Text Available
Joint Time–Frequency Scattering

https://doi.org/10.1109/TSP.2019.2918992

Anden, Joakim; Lostanlen, Vincent; Mallat, Stephane (July 2019, IEEE Transactions on Signal Processing)

Full Text Available
Une ou deux composantes ? La réponse de la diffusion en ondelettes

Lostanlen, Vincent (January 2019, Groupe d'Études en Traitement du Signal et Images (GRETSI) workshop)

Full Text Available
Robust sound event detection in bioacoustic sensor networks

https://doi.org/10.1371/journal.pone.0214168

Lostanlen, Vincent; Salamon, Justin; Farnsworth, Andrew; Kelling, Steve; Bello, Juan Pablo (October 2019, PLOS ONE)
McLoughlin, Ian (Ed.)
Full Text Available
Hybrid scattering-LSTM networks for automated detection of sleep arousals

https://doi.org/10.1088/1361-6579/ab2664

Warrick, Philip A.; Lostanlen, Vincent; Nabhan Homsi, Masun (July 2019, Physiological Measurement)

Full Text Available

« Prev Next »

Search for: All records